Coding theorems for individual sequences

نویسنده

  • Jacob Ziv
چکیده

A quantity called the finite-state complexity is assigned to every infinite sequence of elements drawn from a finite set. This quantity characterizes the largest compression ratio that can be achieved in accurate transmission of the sequence by any finite-state encoder (and decoder). Coding theorems and converses are derived for an individual sequence without any probabilistic characterization, and universal data compression algorithms are introduced that are asymptotically optimal for all sequences over a given alphabet. The finite-state complexity of a sequence plays a role similar to that of entropy in classical information theory (which deals with probabilistic ensembles of sequences rather than an individual sequence). For a probabilistic source, the expectation of the finite state complexity of its sequences is equal to the source's entropy. The finite state complexity is of particular interest when the source statistics are

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

TAUBERIAN THEOREMS FOR THE EULER-NORLUND MEAN-CONVERGENT SEQUENCES OF FUZZY NUMBERS

Fuzzy set theory has entered into a large variety of disciplines of sciences,technology and humanities having established itself as an extremely versatileinterdisciplinary research area. Accordingly different notions of fuzzystructure have been developed such as fuzzy normed linear space, fuzzytopological vector space, fuzzy sequence space etc. While reviewing theliterature in fuzzy sequence sp...

متن کامل

P87: The Role of the Long Non-Coding RNA Sequences (LncRNAs) in Neurological Disorders

Precise interpretation of the transcriptome sequences in the several species showed that the major part of genome has been transcribed; however, just a few amounts of the transcription sequences have open-reading frames which are conversed during the evolution. So, it is unlikely that many of the transcribed sequences code the proteins. Among the all human non-coding transcripts, at least 10000...

متن کامل

Tracking the Sequences of Patient-Therapist Dialogues by Coding Responses during Integrative Psychotherapy

Aim: The Assimilation of Problematic Experiences Scale (APES) for the coding of client responses and the Process Focused Conversation Analysis (PFCA) for coding therapist responses were applied to transcripts of a successful case of integrative psychotherapy of depression. Methods: the research method of the present research was case study. Dialogues (150) between a therapist and client in one ...

متن کامل

Relationship among Complexities of Individual Sequences over Countable Alphabet

This paper investigates some relations among four complexities of sequence over countably infinite alphabet, and shows that two kinds of empirical entropies and the self-entropy regarding a finite state source are asymptotically equal and lower bounded by the muximun number of phrases in distinct parsing of the sequence. Some connections with source coding theorems are also investigated. Furthe...

متن کامل

Phylogenetic Analysis of Three Long Non-coding RNA Genes: AK082072, AK043754 and AK082467

Now, it is clear that protein is just one of the most functional products produced by the eukaryotic genome. Indeed, a major part of the human genome is transcribed to non-coding sequences than to the coding sequence of the protein. In this study, we selected three long non-coding RNAs namely AK082072, AK043754 and AK082467 which show brain expression and local region conservation among vertebr...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • IEEE Trans. Information Theory

دوره 24  شماره 

صفحات  -

تاریخ انتشار 1978